A Single Pass Heuristic Search for Segmental Speech Recognizers
نویسندگان
چکیده
The continuous speech recognition problem is usually modeled as a search for the best path in a network of transitions between states. A full search can be very expensive in terms of computation and storage requirements. By adopting a segment based rather than a frame based approach, one can already obtain a reduction of these requirements, but this may still be insuucient to allow for real time recognition. For our segment based Neural Network / Dynamic Programming hybrid, we have therefore developed a heuristic search method performing the search in a single forward pass. The key problem was to identify a suitable heuristic function which estimates the score of the best path yet to be determined. We found that a simple heuristic function taking into account an average path score per segment, does very well. Even if the admissible loss in recognition accuracy is kept small, our heuristic search method outperforms a traditional Viterbi beam search algorithm.
منابع مشابه
Sequence Prediction with Neural Segmental Models
Segments that span contiguous parts of inputs, such as phonemes in speech, named entities in sentences, actions in videos, occur frequently in sequence prediction problems. Segmental models, a class of models that explicitly hypothesizes segments, have allowed the exploration of rich segment features for sequence prediction. However, segmental models suffer from slow decoding, hampering the use...
متن کاملRisk Based Lattice Cut Segmental Minimum Bayes-r
Minimum Bayes-Risk (MBR) speech recognizers have been shown to give improvements over the conventional maximum a-posteriori probability (MAP) decoders through N-best list rescoring and A search over word lattices. Segmental MBR (SMBR) decoders simplify the implementation of MBR recognizers by segmenting the N-best lists or lattices over which the recognition is performed. We present a lattice c...
متن کاملConfidence based lattice segmentation and minimum Bayes-risk decoding
Minimum Bayes Risk (MBR) speech recognizers have been shown to yield improvements over the conventional maximum a-posteriori probability (MAP) decoders in the context of Nbest list rescoring and A search over recognition lattices. Segmental MBR (SMBR) procedures have been developed to simplify implementation of MBR recognizers, by segmenting the N-best list or lattice, to reduce the size of the...
متن کاملUsing Missing Feature Theory to Actively Select Features for Robust Speech Recognition with Interruptions, Filtering, and Noise*
Speech recognizers trained with quiet wide-band speech degrade dramatically with high-pass, low-pass, and notch filtering, with noise, and with interruptions of the speech input. A new and simple approach to compensate for these degradations is presented which uses mel-filter-bank (MFB) magnitudes as input features and missing feature theory to dynamically modify the probability computations pe...
متن کاملUsing missing feature theory to actively select features for robust speech recognition with interruptions, filtering and noise KN-37
Speech recognizers trained with quiet wide-band speech degrade dramatically with high-pass, low-pass, and notch filtering, with noise, and with interruptions of the speech input. A new and simple approach to compensate for these degradations is presented which uses mel-filter-bank (MFB) magnitudes as input features and missing feature theory to dynamically modify the probability computations pe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008